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In a comparative study, the g-exponential and Weibull distributions are employed to investigate 
frequency distributions of basketball baskets, cyclone victims, brand-name drugs by retail sales, 
and highway length. In order to analyze the intermediate distribution, the g-Weibull 

one, which interpolates the g-exponential and Weibull ones, is introduced. It is verified that the 
basketball baskets distribution is well described by a g-exponential, whereas the cyclone victims 
and brand-name drugs by retail sales ones are better adjusted by a Weibull distribution. On the 
other hand, for highway length the q-exponential and Weibull distributions do not give satisfactory 
adjustment, being necessary to employ the q- Weibull distribution. Furthermore, the introduction 
of this interpolating distribution gives an illumination from the point of view of the stretched 
exponential against inverse power law (g-exponential with q > 1) controversy. 

PACS numbers: 89.20.-a,89.65.-s,89.90.+n 



I. INTRODUCTION 

Frequency distributions have played an important part 
in many investigations in very different contexts. For 
example, distribution of incomeQ, frequency of words 
in a long text0, forest firesQ, scientific citations^, 
world-wide web surfmg[f|, ecology^, solar flaresQ, pop- 
ulation distribution in large citie s |8|, economic index[9j, 
epidemics in isolated populations 10], among others. In 
these cases power law distributions, p p {x) — po x~ a with 
a > 1, have been identified. In general, this distribution 
can not be used for arbitrary x values, since the number 
of events described by a power law is not finite when all 
positive x values are considered. Thus, it is natural to 
investigate deviations from a power law, which can be 
identified as a curvature in log-log graphics. 

One of the simplest generalizations of a power law 
which leads / °° p(x)dx to a finite value is the Zipf- 
Mandelbrot law, p zm (x) = po/(l + sx) a (s > 0), which 
was employed by Mandelbrot to investigate frequency of 
words in long text |ll| . More recently, this distribution 
was generalized in order to incorporate negative a values, 
i.e., 
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where exp 3 (-a;) = [1 - (1 - q)x} 1 ^ 1 -^ if 1 - (1 -q)x > 
and exp (J (— x) = if 1 — (1 — q)x < 0. Thus, for q > 1 we 
have a long-tailed function since p q {x) oc a^ 1 /^ -1 ) for 
x ^> x$/{q — 1); for q < 1 we obtain a short-tailed func- 
tion since p q (x) = for x > Xq/(\ — q); and for q — 1 we 
have an exponential since lim^i exp g (— x) — exp(— x). 
Note also that a convenient identification of the parame- 
ters in Eq. Q immediately leads to the Zipf-Mandelbrot 



law when q > 1. This distribution for q > 1 has 
been employed, for instance, in connection with scien- 
tific citations^|, § oa l distribution^^ an d population 
distribution in a country [T^j. 

Eq. (JJJ can be viewed as a generalization of the 
exponential function since it basically replaces the ex- 
ponential one in the canonical ensemble of the Tsallis 
statistics [la, [l(J- More precisely, p q (x) can be obtained 
from a maximum entropic principle. These facts moti- 
vate us to refer to p q {x) as g-exponential distribution. 
We remark that the Tsallis statistics has been employed 
in connection with several anomalous systems (for a re- 
cent review see Ref. [l7j). 

The Weibull distribution 



Pw(x) = pa 



exp 



(2) 



'Electronic address: lcmala@dfi.ucm .br 



has been employed to accomplish a systematic curva- 
ture in a log-log graphics. The parameter r is taken 
positive and when r = 1 Eq. J5J reduces to an ex- 
ponential. The distribution J2J was firstly employed 
in a statistical theory of the strength of material |lS|. 
and has been used in many contexts such as dielec- 
tric failure j cardiac contraction]^, aspects of road- 
accident death [2l|. wind speed [22f7^ profile of foliage 
area|23j. porcelain strength|2J|, among others. In partic- 
ular, this distribution has been related with multiplica- 
tive process by Laherrere and Sornette[25|, who refers to 
p w {x) as stretched exponential. 

It must be stressed that the parameters xo and r con- 
trol the tail behavior of p w (x) as well as Xq and q dictate 
the p q (x) shape. Furthermore, if we restrict our analysis 
to a finite range for x it is very common that p w {x) and 
p q (x) indistinctly adjust a restrict set of data very well. 
Thus, it is natural to investigate whether distribution, 
p w {x) or p q {x), gives a better adjustment for a large set 
of data. This work is addressed to investigate this ques- 
tion by focusing the frequency distributions of basket- 
ball baskets in a championship, tropical cyclone victims, 
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brand-name drugs by retail sales, and highway length. 
In particular, for the highway length distribution neither 
the (/-exponential nor the Weibull ones lead to a satisfac- 
tory adjustment. In order to accommodate this case and 
the others in a unified framework, we introduce a dis- 
tribution which smoothly interpolates the (/-exponential 
and the Weibull ones. Moreover, this approach gives an 
illumination from the point of view of the stretched ex- 
ponential against inverse power law ((/-exponential with 
q > 1) controversy. 



II. q- WEIBULL DISTRIBUTION 
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We refer to this interpolating function as q- Weibull 
distribution, and it is given by 



p qw {x) = Po 



exp„ 



(3) 



Note that for r = 1 and g / 1 we recover the q- 
exponential. Analogously, in the limit q — ► 1 with r ^ 1 
or r = 1, we obtain the Weibull or the exponential dis- 
tributions, respectively. Furthermore, when q > 1 and 
x 3> xo/(q — l) 1/,r , we verify that p qw (x) cx x~ u with 
u = t[(2 — q)/(q — 1)] + 1- In contrast, when q < 1 and 
x > 2:0/(1 — 9) j we have p qw {x) — 0. 

In order to obtain a sufficiently smooth curve, it is a 
common practice to employ the cumulative distribution 



R{x) 



p{y)dy 
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to investigate the frequency distribution for a set of data. 
In our case, we obtain 



Rq W {x) =Pq exp g 



(5) 



with q' = 1/(2 - q), x' = x /(2 - q)V r , and p' Q = 
Po/(2 — q) ■ It must be emphasized that R qw (x) exists 
only if q < 2. Moreover, from our construction, the cu- 
mulative distributions, R q (x) or R w (x), follow from Eq. 

by taking the appropriate limits (r — > 1 or q — > 1). 
Therefore, the cumulative distribution of a power law is a 
power again, of a (/-exponential is another (/-exponential, 
of a Weibull distribution is a stretched exponential, and 
of a q- Weibull is a (/-stretched. 

Before starting our empirical investigation of data by 
using these distributions, we would like to remark that 
the functions R q (x), R w (x), and R qw (x) can be related 
with diffusion equations. In fact, R q (x) satisfies the gen- 
eralized decay equation dR q {x)/dx — — a[R q (x)] q with 

a = (1 — q^PQ 1 q /x' ; R w (x) is intimately related with 
the point source solution of the anomalous diffusion pro- 
posed by O'Shaughnessy and Procaccia[26| ; and R qw {x) 
occurs in the solution of the nonlinear diffusion equation 
proposed in Ref. [^J. Moreover, R qw (x) obeys the gen- 
eralized decay equation dR qw (x) / dx — —6 x r ~ 1 [R qw (x)] q 

with b = r(l -<z> W /zo r - 



FIG. 1: Basketball basket cumulative distribution for two- 
point shots in 1999 NBA championship. The parameters em- 
ployed in R(x) [Eq. JSJ] are obtained from those presented in 
Tab. {Q. 
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FIG. 2: Fatal victims cumulative distribution for tropical At- 
lantic cyclones. The parameters employed in R(x) [Eq. 
are obtained from those presented in Tab. Q. 



III. EMPIRICAL ANALYSIS 

In this section, we are going to apply the (/-exponential, 
Weibull, and q- Weibull distributions to analyze the data 
about basketball baskets in a championship, tropical cy- 
clone victims, brand-name drugs by retail sales, and high- 
way length. To obtain the best set of values for the 
parameters, we use the least square minimum method. 
Complementary, as an additional tool in the analysis of 
how good the adjustment is, we use the residual error 
analysis. In this kind of analysis, a good adjustment is 
related to a gaussian residual frequency distribution and 
a random distribution of errors. 

Naturally, since the q- Weibull distribution the (/-expo- 
nential or the Weibull ones has as limit case; in general, it 
gives a better adjustment than the two others. However, 
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FIG. 3: Cumulative distribution of brand-name drugs by re- 
tail sales. The parameters employed in R(x) [Eq. @] are 
obtained from those presented in Tab. Q. 



TABLE I: Fit parameters to g-exponential, Weibull, and q- 
Weibull distributions for the analyzed systems. 



System 




g-exponential 


Weibull 


q- Weibull 


Basketball 


q 


0.68 


- 


0.51 




r 


- 


1.15 


0.91 




x 


313 


212 


387 




Po 


520 


387 


601 


Cyclone 


q 


1.58 


- 


0.95 




r 


- 


0.23 


0.22 




Xq 


24.9 


4.41 


6.39 




Po 


145 


1.14 x 10 3 


1.18 x 10 3 


Drug 


q 


1.39 




0.88 




r 




0.35 


0.25 




xo 




1.30 x 10 4 


7.03 x 10 3 




Po 


266 


1.41 x 10 3 


2.49 x 10 3 


Highway 
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1.12 




0.077 
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0.87 


0.57 
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712 


2.96 x 10 3 




PO 


181 


215 


465 



100 









%\ 








\ ^ 


O Highway 


\° 


— — q-exponential 


\° 


- - - Weibull 




q-Weibull 









100 1000 
X 



FIG. 4: Length highway cumulative distributions in USA. 
The parameters employed in R(x) [Eq. ^] are obtained from 
those presented in Tab. JIJ. 

there are cases where the adjustment with the q- Weibull 
distribution does not give a significative improvement 
when compared with that obtained from one of the two 
other distributions. In such cases, either the parameter 
r is close to one leading to a q-exponential distribution 
or the parameter q is close to one leading to a Weibull 
distribution. As we are going to verify below, there are 
cases where such limits are not a good approximation, 
therefore indicating that the q-Weibull distribution gives 
a significative improvement in the adjustment. Now we 
describe the systems to be analyzed. 

It is natural to investigate aspects related to sports 
since it has attracted interest of a large number of peo- 
ple in our society. In fact, many studies concerning 
sports have been developed. For instance, those re- 
lated to frequency distributions in soccer [13. b aseball [28l 
|M |U, gpMpL hockey [H HI, footbaiilM HI, and 
basketball|29j,|3lj. Here, we concentrate our attention on 



basketball. More precisely, we consider the distribution 
of two-point basket among approximately four hundred 
players of 1999 NBA championship j^. 

In our analysis about basketball baskets, we verified 
that the q-exponential function gives a better adjustment 
than the Weibull distribution since the parameter r, ob- 
tained from the q-Weibull one, is close to one (r = 0.91). 
The basketball basket cumulative distribution, as well as 
the corresponding fits, is presented in Fig. (Q). Tab. (P) 
contains the parameters employed in the fits presented in 
Fig. ■ We remark that in the previous examples about 
q-exponential distribution (Refs. [H El Ei ) the param- 
eter q is bigger than one, in contrast with the present 
example where q = 0.68. As far as we are concerned 
this is the first system whose adjustment leads to a q- 
exponential distribution with q < 1. Thus, our adjust- 
ment suggests the existence of a cutoff for the maximum 
number of baskets x c for 1999 NBA championship. This 
x c = x' / (1 — q') = 9.9 x 10 2 is compatible with the true 
maximum number of baskets, 839. In contrast to this 
case, now we focus on systems, which are well described 
by a Weibull distribution. 

Sudden inundations, such as those due to cyclones, are 
known as the most significative natural catastrophe |33|. 
leading along the years to massive death and loss of prop- 
erty. Studies related with such phenomena enclose the 
spatial and temporal distribution of cyclone for a given 
regionj^. Furthermore, attempts to identify the clima- 
tologic process that leads to cyclone formation have em- 
ployed meteorologic satellites [35|. However, since such 
technology is only recently disposable, it is difficult to 
investigate cyclones that have occurred for a long time. 
Thus, in order to analyze at least in part the effects of 
cyclones, including those occurred many years ago, we 
consider the Atlantic tropical fatal victims number from 
1492 to 1999 36J. In this period, two hundred and sixty- 
three Atlantic tropical cyclones with deaths were cata- 
logued. 
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In the analysis of cumulative distribution of deaths as 
a consequence of tropical cyclones, the adjustment with a 
q- Weibull distribution leads to q close to one (q = 0.95). 
This indicates that Weibull distribution can be in a good 
agreement with the data. In Fig. (J2J, we plot the cu- 
mulative distribution of cyclone fatal victims versus the 
number of deaths related to it by using the three distri- 
butions employed in this work. In Tab. (JIJ, we show the 
correspondent parameters. 

An important part of the expense in our society is in- 
vested in health. In this context, the use of pharmaceutic 
services plays a special role. This decisively contributes 
to an enormous competition among pharmaceutic indus- 
tries, motivating the appearance of new drugs every year. 
For these reasons, studies have been performed, for in- 
stance, in the relationship between pharmaceutical ex- 
penditures and income |37|. In this work, we investigate 
another aspect, the distribution of brand-name drugs. 
Here we are considering the top two hundred brand-name 
drugs by retail sales in 1999 in USA 38j. As the above 
system, the adjustment with q- Weibull gives q close to 
one (q = 0.88), so the Weibull distribution leads to a 
better adjustment than the g-exponential one. This fact 
can be visualized in Fig. and the optimal parameters 
are presented in Tab. JIJ. 

In the above systems, the q- Weibull distribution does 
not give a significative improvement since the systems 
can be well described by g-exponential or Weibull ones. 
But this may not occur in general. This is just the case 
of the following example. 

In general, roads strategically connect different parts 
of a country. In particular, understanding the myriad of 
aspects related with road transport is of great value since 
the economy of a region is strongly connected with the 
transport infrastructure. Possible aspects related to road 
transport are the study of mortality and the morbidity 
due to road accidents)^, and the influence of the 
length of the road segment in the statistical description of 
accident counts and density |4*cj. Here, we focus attention 
on the distribution of the highway length, by analyzing 
one hundred and ninety-eight USA highways[4l|. In Fig. 



(@J, we can see that neither g-exponential nor Weibull 
distributions are able to give a satisfactory adjustment. 
In this system, the interpolating q- Weibull distribution 
is necessary to have a good fit. In fact, the parameters r 
and q are not close to one (r = 0, 57 and q' — 0.077). 

In every system, we had applied the mean square mini- 
mum method to obtain the optimal parameters. We also 
used the residual errors analysis to confirm the goodness 
of the adjustments. To exemplify such procedure, we 
present the residual analysis to highway length distribu- 
tion in Fig. (JSJ. The graphic of frequency distribution 
of residues and the graphic of predicted versus residual 
values to q- Weibull distribution [Fig. 0c)] are better 
than the corresponding ones referent to ^-exponential 
[Fig. (JSJ-a)] and Weibull distributions [Fig. JljJ-b)]. In 
fact, the histogram for the residual errors seems closer to 
a gaussian for q- Weibull distribution than those ones to 
g-exponential and Weibull distributions. The analogous 
improvement occurs in the randomness observed in the 
graphics of predicted versus residual values. 

An alternative distribution that generalizes the q- 
exponential one is the distribution that arrives from the 
differential equation (4^ 



~r = -vp r - (A - n)p 9 , 

ax 



(6) 



where q, r, /i, A and the initial condition po are real pa- 
rameters to be adjusted. The ^-exponential distribution 
is recovered if \i = or if r = q. However, p(x) obtained 
from Eq. © does not contain the Weibull distribution as 
particular case, desirable feature since, as emphasized in 
this work, many systems are well adjusted by the Weibull 
distribution. The fact that Eq. I© does not contain the 
Weibull one as a particular case is clear since it satisfies 
the differential equation 



dp 
dx 



(c-1) 



In addition, we mention some practical difficulties in the full application of a distribution dictated by Eq. {Jjj. 
First, Eq. © does not have analytical solutions for all r and q, so we need to employ a numerical approach. Second, 
it contains five parameters to be adjusted (one more than q- Weibull). Due to these facts, the analysis and adjustment 
become cumbersome. On the other hand, when the distribution exhibits two slopes, the solution of Eq. JBJ seems 
appropriate since neither q-exponcntial nor q- Weibull is able to reproduce this behavior. This characteristic has been 
explored in the study of folded proteins|42j. Another possibility of application of Eq. © occurs when the distribution 
presents a well definite slope in the intermediate region. This situation can be verified in linguistics, when investigating 
deviation from the Zipf-Mandelbrot law0|. Beside these difficulties (five parameters and numerical solution of Eq. 
©), we have investigated the adjustment of our data by using p(x) from Eq. @. We mention first that standard 
programs of adjustment can not be used since we do not have an analytical form to the distribution from Eq. JBJ. 
Furthermore, due to the large number of parameters, a method that not only does not strongly depend on initial set 
of parameters but also presents fast convergence is necessary. In this direction, we applied the simulated annealing 
method, as employed in Ref. |44| . to obtain a good set of parameters. However, even this method was not good 
enough to lead to a satisfactory result. 
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FIG. 5: Residual distributions to highway system when employing a) ^-exponential, b) Weibull, and c) g- Weibull. The graphics 
on the left concern the frequency distribution of residues and the graphics on the right refer to predicted versus residual values. 



IV. CONCLUSION 

In several areas of nature Weibull and Zipf- Mandelbrot 
(^-exponential with q > 1) have an important role to de- 
scribe many frequency distributions. For a short range, 
both distributions can lead to an apparent agreement. 
Thus, to decide which distribution gives a better descrip- 
tion, it is necessary to consider a sufficiently large range. 
In this work, in order to investigate the referred contro- 
versy, we focalized systems just taking a sufficiently large 
range of data into account. Moreover, as to obtain a fur- 
ther illumination on this controversy, we introduce a new 
distribution, q- Weibull. Such distribution slowly interpo- 
lates the ^-exponential and the Weibull ones. When ei- 
ther the g-exponential or the Weibull distribution is able 



to give a good description of a system, its respective pa- 
rameter, r or q, is close to one. We applied q- Weibull 
function to analyze distributions of basketball baskets, 
cyclone victims, brand-name drugs by retail sales, and 
highway length. We verified that the basketball baskets 
distribution is well described by a ^-exponential with 
q < 1. For cyclone victims and brand- name drugs by 
retail sales, the Weibull distribution gives a good ad- 
justment. On the other hand, for length of highway, q- 
Weibull distribution gives a satisfactory adjustment. We 
hope that this new distribution can be useful in other 
situations where neither ^-exponential nor Weibull ones 
leads to a satisfactory result. 
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partial financial support. 
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